Mixture models for undiagnosed prevalent disease and interval-censored incident disease: applications to a cohort assembled from electronic health records.
نویسندگان
چکیده
For cost-effectiveness and efficiency, many large-scale general-purpose cohort studies are being assembled within large health-care providers who use electronic health records. Two key features of such data are that incident disease is interval-censored between irregular visits and there can be pre-existing (prevalent) disease. Because prevalent disease is not always immediately diagnosed, some disease diagnosed at later visits are actually undiagnosed prevalent disease. We consider prevalent disease as a point mass at time zero for clinical applications where there is no interest in time of prevalent disease onset. We demonstrate that the naive Kaplan-Meier cumulative risk estimator underestimates risks at early time points and overestimates later risks. We propose a general family of mixture models for undiagnosed prevalent disease and interval-censored incident disease that we call prevalence-incidence models. Parameters for parametric prevalence-incidence models, such as the logistic regression and Weibull survival (logistic-Weibull) model, are estimated by direct likelihood maximization or by EM algorithm. Non-parametric methods are proposed to calculate cumulative risks for cases without covariates. We compare naive Kaplan-Meier, logistic-Weibull, and non-parametric estimates of cumulative risk in the cervical cancer screening program at Kaiser Permanente Northern California. Kaplan-Meier provided poor estimates while the logistic-Weibull model was a close fit to the non-parametric. Our findings support our use of logistic-Weibull models to develop the risk estimates that underlie current US risk-based cervical cancer screening guidelines. Published 2017. This article has been contributed to by US Government employees and their work is in the public domain in the USA.
منابع مشابه
Validity of Cardiovascular Disease Event Ascertainment Using Linkage to UK Hospital Records
BACKGROUND Use of electronic health records for ascertainment of disease outcomes in large population-based studies holds much promise due to low costs, diminished study participant burden, and reduced selection bias. However, the validity of cardiovascular disease endpoints derived from electronic records is unclear. METHODS Participants were 7860 study members of the UK Whitehall II cohort ...
متن کاملModel Selection Based on Tracking Interval Under Unified Hybrid Censored Samples
The aim of statistical modeling is to identify the model that most closely approximates the underlying process. Akaike information criterion (AIC) is commonly used for model selection but the precise value of AIC has no direct interpretation. In this paper we use a normalization of a difference of Akaike criteria in comparing between the two rival models under unified hybrid cens...
متن کاملTracking Interval for Doubly Censored Data with Application of Plasma Droplet Spread Samples
Doubly censoring scheme, which includes left as well as right censored observations, is frequently observed in practical studies. In this paper we introduce a new interval say tracking interval for comparing the two rival models when the data are doubly censored. We obtain the asymptotic properties of maximum likelihood estimator under doubly censored data and drive a statistic for testing the ...
متن کاملطراحی و ایجاد پرونده ی الکترونیک سلامت بیماران مول هیداتیفرم و بررسی میزان تکمیل اطلاعات در پرونده های کاغذی بیماران
Background and Aim: To provide effective care, health care providers need timely and appropriate information. Electronic records provide quick access and easy management of data. The aim of this study was to develop electronic health records for patients with hydatidiform mole and evaluation of completeness of medical records Materials and Methods: This applied study was conducted in 2017. Aft...
متن کاملUse of an electronic health record to identify prevalent and incident cardiovascular disease in type 2 diabetes according to treatment strategy
BACKGROUND The increasing use of electronic health records (EHRs) in clinical practice offers the potential to investigate cardiovascular outcomes over time in patients with type 2 diabetes (T2D). OBJECTIVE To develop a methodology for identifying prevalent and incident cardiovascular disease (CVD) in patients with T2D who are candidates for therapeutic intensification of glucose-lowering the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Statistics in medicine
دوره 36 22 شماره
صفحات -
تاریخ انتشار 2017